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ABSTRACT 

The  optimal  time-sequential  distribution  of  supporting 
fire  against  enemy  ground  units  in  combat  against  attacking 
friendly  units  is  studied.   Lanchester- type  models  of  war- 
fare are  combined  with  optimal  control  theory  in  this  in- 
vestigation.  The  optimal  time-sequential  fire-support 
policy  is  characterized  for  a  specific  problem.   Although 
complete  details  for  the  determination  of  the  optimal  policy 
are  not  given,  it  is  conjectured,  on  the  basis  of  the 
theorems  which  were  proved,  that  for  this  problem  the  optimal 
policy  is  to  always  concentrate  all  supporting  fire  on  the 
same  enemy  unit  (until  supporting  fire  must  be  lifted). 
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I.   INTRODUCTION 

The  problem  of  fire  support  allocation  is  a  problem  of 
interest  to  the  military  tactician  and  planner.   In  this 
paper,  we  will  study  the  problem  of  artillery  fire  support 
allocation  against  several  enemy  ground  units.   The  problem 
is  to  determine  an  optimal  time-sequential  policy  for  dis- 
tributing available  fire  support  without  wasting  it.   By 
wasting  of  fire  support  we  mean  overkilling,  i.e.,  the 
destruction  of  enemy  forces  that  does  not  contribute  to  the 
attainment  of  the  friendly  objective.   Overkill  would  cor- 
respond to  a  state  variable  which  represents  the  strength 
of  a  ground  unit  becoming  negative.   We  do  not  want  to  over- 
kill because  it  will  be  simply  wasting  the  effort. 

To  determine  a  "good"  allocation  of  fire  support,  one 
must  consider  the  dynamic  nature  of  combat  in  which  opposing 
forces  trade  casualties  over  a  period  of  time.   Lanchester 
type  models  of  warfare  (see  [15])  have  been  developed  to 
provide  insights  into  the  dynamics  of  combat.   Although  com- 
bat attrition  certainly  is  a  random  process,  such  a  formula- 
tion is  not  mathematically  tractable  as  far  as  analytic  work 
is  concerned.   Therefore,  we  will  consider  deterministic 
Lanchester- type  models  in  our  work  here.   These  are  differen- 
tial equation  models  of  the  combat  attrition  process.   With 
such  a  dynamical  system  model  and  considering  that  fire  sup- 
port decisions  are  made  over  a  period  of  time,  one  is  led 


to  an  optimal-control/differential-game  formulation  for  the 
determination  of  optimal  time-sequential  fire  support 
allocation. 

Such  time-sequential  combat  optimization  problems  may 
be  cast  as  either  optimal  control  problems  or  differential 
game,  depending  on  whether  or  not  both  sides  are  modelled 
as  rational  decision  makers.   In  the  exploratory  work  at 
hand,  we  will  consider  the  optimization  of  combat  decision 
by  only  one  of  the  combatants.   This  leads  to  consideration 
of  an  optimal  control  problem.   Such  a  problem  has  been 
referred  to  as  a  Lanchester-type  optimal  control  problem  by 
Taylor  (see  [14]). 

In  a  deterministic  combat  optimization  problem,  we  usually 
encounter  either  a  static  or  dynamic  situation.   If  the  sit- 
uation.  If  the  situation  is  static,  we  may  determine  the 
optimal  policy  using  the  classical  techniques  from  advanced 
calculus  for  problems  that  are  either  unconstrained  or  sub- 
ject to  equality  constraints.   If  there  are  inequality  con- 
straints in  the  optimization  problem,  the  methods  of  nonlinear 
programming  may  be  used.   If  the  combat  situation  is  dynamic, 
we  may  apply  the  methods  of  the  classic  calculus  of  variations 
for  problems  that  are  either  unconstrained  or  subject  to 
equality  constraints  (see  [4]).   However,  if  inequality  con- 
straints are  involved,  it  may  be  necessary  to  apply  the 
recent  results  of  optimal  control  theory. 

The  development  of  solutions  to  Lanchester-type  optimal 
control  problems  is  not  a  routine  matter  due  to  several  tech- 
nical difficulties.   Due  to  physical  considerations  force 
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levels  cannot  become  negative,  and  we  must  introduce  state- 
variable  inequality  constraints  (SVIC's),  i.e.,  no  state 
variable  (force  level)  can  be  allowed  to  be  negative.   All 
Lanchester-type  dynamic  tactical  allocation  problems  have 
SVIC's.   The  maximum  principle  (in  its  original  form)  is 
inadequate  to  solve  a  problem  with  SVIC's,  since  an  ex- 
tremal may  contain  a  subarc  which  lies  on  the  boundary  of 
the  state  space.   By  an  extremal  we  mean  any  trajectory  on 
which  the  necessary  conditions  of  optimality  are  satisfied 
for  all  t.   When  SVIC's  are  present  in  a  problem,  the 
necessary  conditions  (i.e.,  Pontryagin's  maximum  principle) 
need  to  be  modified  (see  [7]).   To  simplify  considerations 
for  the  problem  studied  in  this  thesis,  we  will  assume  that 
conditions  are  such  that  all  state  variables  are  always 
non-negative.   Thus  SVIC's  are  not  considered  further  in 
this  thesis. 

Another  difficulty  for  this  type  of  problem  is  the  pos- 
sible presence  of  a  singular  subarc.   Consider  a  control 
problem  in  which,  for  example,  a  single  control  variable  u 
appears  linearly  in  the  Hamiltonian  H.   The  maximum  prin- 
ciple fails  to  determine  the  optimal  control  when  SH/3u  =  0 
for  a  finite  interval  of  time.   The  corresponding  trajectory 
is  called  a  singular  subarc .   Use  of  the  terminology  singu- 
lar is  motivated  by  the  fact  that  in  such  problems  the  matrix 
of  second  partial  derivatives  of  the  Hamiltonian  with  respect 
to  the  control  variables  is  a  singular  matrix.   We  will  call 
an  extremal  singular  if  it  contains  a  singular  subarc. 
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We  will  study  the  problem  at  hand  in  two  phases.     In  the  first  phase 
we  will  consider  nonsingular  extremals,    while  in  the  second  phase  we 
will  consider  singular  extremals.     One  result  of  the  study  for  this 
problem  is  that  there  is  no  nice  algorithm  to  determine  the  optimal 
control  for  our  problem.     In  general,    combat  optimization  problems 
dealing  with  multiple  units  are  extremely  complicated,    especially  if 
there  are  some  singular  subarcs. 


II.   ARTILLERY  FIRE  SUPPORT  ALLOCATION  MODEL 

Consider  combat  between  two  ground  forces  Red  and  Blue. 
Each  side  is  composed  of  two  infantry  units.  Blue  infantry 
units  are  being  supported  by  artillery  fire. 

We  want  to  find  the  best  policy  for  allocating  Blue's 
artillery  fire  support  against  Red  to  achieve  the  maximum 
effectiveness.   The  measure  of  effectiveness  is  taken  to  be 
the  ratio  of  Blue  infantry  forces  to  Red  at  the  given  time. 
The  problem  is  to  determine  an  optimal  policy  by  which  the 
ratio  of  two  infantry  forces  at  the  end  of  the  battle  can 

'r 

be  maximized. 
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The  Blue  forces  will  be  denoted  as  X.. -,  and  X1 2  (with 
corresponding  force  levels  x-  ..  (t)  and  x12(t))  and  the  Red 
forces  as  X?1  and  X22  (with  corresponding  force  levels 
x21(t)  and  x22(t)). 

The  following  assumptions  will  be  made  for  developing 
the  model. 

1.  Each  infantry  unit  is  only  in  contact  with  his  op- 
posing unit. 

2.  The  Blue  infantry  units  are  moving  to  contact  with 
the  Red  units  and  due  to  their  movement  cause  negligible 
attrition  to  the  opposing  Red  forces.   Furthermore,  the 
Red  defensive  positions  are  prepared  and  relatively  invul- 
nerable to  small  arms  fire  during  this  approach  phase. 

3.  The  defensive  Red  units  cause  attrition  to  the  op- 
posing Blue  units  according  to  a  "square  law"  attrition 
process . 

4.  The  Blue  artillery  delivers  area  fire  against  the 
Red  infantry. 

With  the  above  assumptions  we  may  develop  the  following 
combat  optimization  model: 


xn(T)      x12CT) 

maximize        J  =  k., ^tv-  +  k~ ?--, 

lx21(T)     2x22(T) 


subject  to:  x ..  ,  ( t )  =  -  a  -.  x  ~  ,  ( t ) 


x12(t)  =  -clxX2(t) 


x21(t)  =  -a2u(t)x21(t) 


x22(t)  ='-C2(l  -  u(t))x22(t)  (la) 

1  1 


where   x..ft):  force  level  at  time  t 
ij  - 

x- . ft) :  the  time  derivatives  of  state  variables  x. . ft) 

T:  time  at  which  the  battle  terminates 

u(t) :  control  variable 

a»  ,  c^:  attrition  rate  coefficients 

k. :  utilities  assigned  to  ratio  of  survivors. 

Since  the  criterion  functional  contains  force  ratios  at 
the  terminal  time,  we  may  formulate  a  problem  with  a  state 
space  of  lower  dimension  by  defining  new  state  variables  as 

xll  x12 

x    "  Xl       x    *  X2 ' 
x21  22 

Then  we  can  show  that  (la)  is  equivalent  to  the  following 
optimal  control  problem: 


maximize        J  =  k-X..  (T)  +  k?x?(T) 
subject  to     x1  =  -a..  +  C-.UX.. 

x-  =  -a~  +    c~(l-u)x?  (lb) 

where  all  constant  coefficients  remain  same  as  (la) . 

Let  us  assume  that  we  are  given  the  initial  conditions 

x^O)  =  x°  ,    x2(0)  =  x°. 

Let  us  assume  also  that  we  are  given  a  specified  time  T.  which 
is  called  the  duration  of  the  combat.   We  will  assume  that 
the  battle  will  be  terminated  at  time  T  under  one  of  the 
following  conditions: 
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Condition  1:   x..  (T)  =  *2(T)  =  0,  for  some  T<T. 
Condition  2 :   T=T-  . 


13 


III.   DEVELOPMENT  OF  NECESSARY  CONDITIONS  OF  OPTIMALITY 

To  develop  the  necessary  conditions  for  optimality,  let 
us  define  the  Hamiltonian  for  the  problem  (lb) , 


H 


(t,x,p,u)  =  p1(-a1+c1ux1)  +  p2(-a2+c2(l-u)x2) ;     (2) 


p1  and  p?  are  the  adjoint  variables,  corresponding  to  the 
state  variables  x..  and  x~ ,  respectively  and,  satisfying  the 
differential  equations: 


dp1 
~3t 


dp2 
dt 


3H 

3x7  =  "C1UP1 


3H        f1   -, 
35^  =  -c2(l-u)p2 


(3) 


By  Pontryagin's  maximum  principle,  we  can  show  that  the 
nonsingular  optimal  control  will  be  obtained  by  maximizing 
the  Hamiltonian  over  the  control  variable.   Hence  we  may 
observe  that  the  extremal  control  is  then  given  by 


u(t) 


if  S  <  0 

if  S  >  0 
v indeterminate  if  S  =  0, 


r0 

1 


where  S  =  c-.p-.x-.  -  c-p-x,,. 

We  will  call  u  the  extremal  control  function  and  S  the  switch- 
ing function.   The  extremal  control  u  so  defined  may  furnish 
the  global  maximum,  a  local  maximum,  or  only  a  stationary 
values  for  the  criterion  function. 
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Since  the  Hamiltonian  H  is  a  linear  function  of  the  con- 
trol variable  u,  if  the  switching  function  S  becomes  iden- 
tically zero  for  some  finite  interval  of  time,  the  maximum 
principle  fails  to  provide  any  information  about  the  optimal 
control  u.   In  this  case  all  the  admisible  values  of  u 
maximize  the  Hamiltonian;  therefore  theory  of  singular  ex- 
tremals  is  required  to  solve  this  problem.   If  S  =  ■=— ■  is  to 
be  identically  zero  for  some  finite  interval  of  time,  then 
all  its  derivatives  with  respect  to  the  time  must  also  van- 
ish.  That  is,  the  trajectory  remains  on  a  singular  subarc 
if  and  only  if 


o  -  M  =  *   r  an  ,  =  _di  (  m  ,  m   ...  f5) 

U    du        dT  L  3u  J    dt2  L  9u  J  ^ 


From  these  conditions,  we  will  have 

0  =  IS  =  S  =  ClPlXl  '  C2?2X2  (6) 


dT  <  IS  )  =  §  =  -alClPl  ♦  a2c2p2  (7) 


0  "  oT^  C  |S  )  =  §  =  alC^Plu  -  a2c^p2(l-u).         (8) 
From  (6)  -  (8) ,  we  have 

u  =  1 =  s,  (9) 

Cl  +  C2 

defining  s,  the  singular  control. 

The  generalized  Lcgendrc -Clcbsch  condition  is  necessary 

for  singular  subarc  to  yield  a  maximum  return  (sec  [9]). 
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2k 
The  condition  will  be  given  (-l)kJL  [  $-^r-   (  ^  )]  <  0.   For 

dU   j.ZK     dU 

the  problem  at  hand, 

3rd     /•   9H   nt  9  2      ..  n 

3IT  [  cTt^  (  3U  }  ]  =  alClPl  +  a2C2P2  *  °> 

since  p.  >  0.   Therefore,  the  Legendre-Clebsch  condition  is 
satisfied. 

For  the  singular  subarc  S  =  S  =  0;  it  follows  that 

x1  (t)    a- 

We  will  refer  to  the  surfaces  S   defined  by  (10)  ,  and  S  , 
defined  by  (7)  as  singular  surfaces.   Singular  arcs  lie  in 
these.   The  surface  defined  by 

S  =  c1p1x1  -  c2p2x2  =  0 

is  called  the  switching  surface. 

The  surfaces  or  curves  S   and  S   can  be  drawn  in  the  p 

p      x  r 

and  x  space,  respectively,  but  the  switching  surface  S  in- 
volves both  p  and  x  and  we  cannot  visualize  it  easily. 
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IV.   SYNTHESIS  OF  EXTREMAL  CONTROL 

A.   SYNTHESIZING  FUNCTION 

Let  X  be  the  Euclidean  space  on  which  the  state  variables 
x  and  their  associated  adjoint  variables  p  are  defined,  and 
U  be  the  Euclidean  space  for  control  variables.   Suppose  we 
can  find  a  real  valued  function  g:   X-*U  such  that  x  =  f(x,g(x,p)) 
determines  all  the  extremal  trajectories  from  initial  to  ter- 
minal time.   Then  the  optimal  control  function  may  be  explicit- 
ly determined  by  the  function  of  state  variables  and  adjoint 
variables  such  that  the  optimal  control  at  each  time  depends 
only  on  the  points  of  the  space  at  which  the  state  variables 
and  the  adjoint  variables  are  located  at  the  given  time. 
The  function  g(x,p)  will  be  called  a  synthesizing  function 
and  the  explicit  determination  of  such  a  function  is  called 
the  synthesis  of  the  extremal  control,  and  the  explicit  de- 
termination of  the  time  history  of  the  extremal  trajectory 
is  called  synthesis  of  extremal  trajectory. 

Unfortunately,  an  existence  theorem  for  the  synthesizing 
function  has  not  been  proved.   Furthermore,  there  is  no  general 
algorithm  for  the  problem  of  finding  such  a  function.   If  the 
set  of  terminal  values  for  the  state  variables  is  not  explicit- 
ly specified,  it  may  be  extremely  difficult  to  find  a  syn- 
thesizing function.   The  determination  of  the  synthesizing 
function  must  be  done  on  a  case-by-casc  basis,  since  there  is 
no  general  algorithm.   In  the  problem  at  hand  the  terminal 
state  is  not  specified.   However,  we  know  that  the  control 
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depends  upon  the  state  variable  x  and  the  adjoint  variable 
p,  and  it  will  be  determined  by  the  maximum  principle  when 
(x,p)  does  not  belong  to  the  singular  surface. 

The  difficulty  is  that  x  is  only  known  initially  and  p 
is  known  only  at  the  problem's  end  until  we  have  managed  to 
match  x  and  p  with  the  maximum  principle  to  get  a  solution. 

However  we  may  analyze  the  behavior  of  the  switching 
function,  which  is  a  type  of  synthesizing  function.   Although 
it  involves  both  state  and  adjoint  variables  we  can  infer 
its  behavior  as  a  function  of  time,  along  an  extremal,  to 
establish  several  results.   Further  the  trajectory  lies  on 
singular  subarc  for  some  finite  interval  of  time,  we  may 
consider  the  singular  surfaces  S   and  S   as  a  synthesizing 
function. 

B.   SYNTHESIS  OF  CONTROL  FOR  A  TIME  INTERVAL  DURING  WHICH 
THE  CONTROL  IS  CONSTANT 

For  synthesizing  extremal  trajectory,  let  us  consider 

the  constant  control  at  first.   Suppose  we  choose  a  constant 

optimal  control  from  the  beginning  to  the  end  of  the  battle. 

Then  from  the  differential  equation  (lb)  ,  we  will  have  the 

following  solutions. 

i 

For  u  =  1; 

3.      C  t     3. 

xAt)    =   max  [(x°  -  -1  )e  *   +-1,0] 

1    cl         cl 

x2(t)  =  max  [  x°  -  a-t,  0  ]  (12a) 
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c-(T-t) 

Pl(t)  =  k1e 

P2(t)  -  k2  (12b) 

c.CT-t) 
S(t)  =  c1k1x1e        -  c2k2x2  (12c) 

c.(T-t) 
S(t)  =  -a1c1k1e        +  a2c2k2  (12d) 

,  c-(T-t) 
S(t)  =  a^e  L  (12e) 

For  u  =  0; 

x1(t)  =  max  {x°    -  ajt,  0} 

x2(t)  =  max  {(x°  -  -2  )e  2.  +  ^-  ,  0}         (13a) 

Li  L, 


Pr(t)  =  kx 


c2(T-t) 


p2(t)  =  k2e  '  (13b) 

c  (T-t) 
SCt)  =  c1k1xi  -  c2k2x2e  (13c) 

c  (T-t) 
S(t)  =  -a-^^  +  a2c2k2e  (13d) 

c7(T-t) 
S(t)  =  -a2c2k2e  z  (13e) 

c2 
For  u  =  3  = 


cl  +  c2 


al  ,  _at  .  al 


xx(t)  =  max  {(xj  -  -i  )e"L  +  -±  ,  0} 


a2  %.ot  .  a2 


x2(t)  =  max  {(xj  -  -£  )e"L  +  -|  ,  0}  (14a) 

Pl(t)  =  kle^T-^ 

p2(t)  =  k2ea(T_t)  (14b) 
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S(t)    =    (c1k1x1   -   c2k2x2)ea(:T"t:)  (14c) 

S(t)    =    (-a^^    -   a2c2k2)ea(T"t)  (14d) 

>SW    "   cT^Hr  e01*1-^  (a^Jki    -    a.c.k,)  (14e) 

12  > 


where 


clc2 


Cl   +   C2 


We  will  use  the  above  results  to  determine  the  optimal 
policy  for  more  general  cases. 

C.   DEVELOPMENT  OF  THEOREMS  ON  THE  CHARACTERIZATION  OF  AN 
OPTIMAL  POLICY 

By  analyzing  the  behavior  of  switching  function,  we  may 

establish  the  following  theorems. 

THEOREM  1 

Let  E  be  a  maximizing  extremal  which  contains  no  singular 
subarc  for  the  optimal  control  problem  given  by  (lb).   Then 
there  is  at  most  one  switching  in  E. 

Proof: 

By  hypothesis,  since  E  does  not  contain  a  singular  sub- 
arc,  the  optimal  control  can  be  determined  by  Pontryagin's 
maximum  principle,  except  possibly  at  one  point,  as  we  shall 
see.   We  have  got  the  switching  function  from  the  Hamiltonian 
by  means  of  the  maximum  principle, 

S(t)  =  c1p1x1  -  c2p2x2. 
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Then  its  derivative  is 

S(t)  -  a2c2p2  -  aic1?1 
and 

S(t)  =  ajcJpjU  -  a2c*p2(l-u). 
It  follows  that 

S(t]  >  0  if  u  =  1 
S(t)  <  0  if  u  =  0. 

Since  the  trajectory  does  not  lie  on  singular  subarc,  we  will 
have  either  u  =  1  or  u  =  0  at  t  =  0.   Let  us  consider  the  case 
u  =  1  at  t  =  0;  a  similar  discussion  will  hold  if  u  =  0  ini- 
tially.  Since  u  =  1  at  t  =  0 ,  S(0)  >  0. 

1.   Let  us  consider  first  the  cases  in  which  S(0)  >  0. 
There  are  two  sub-cases,  accordingly  as  S(0)  >  0  or  S(0)  <  0. 

Case  1:   S(0)  >  0  and  S(0)  >  0. 

Since  S(0)  >  0 ,  we  must  choose  u  =  1  at  the  beginning  of 

Cl(T-t) 
the  game.   From  (12b),  p-,(t)  =  k1e       ,  P2(t)  =  k2 .   There- 
fore, p-.(t)  is  decreasing  function  of  t  and  P2(t)  i-s  constant. 
Hence  S(0)  >  0  implies  S(t)  >  0  and  S(t)  >  0,  for  all  t  >  0. 
Therefore  S(t)  is  a  monotone  increasing  function,  and  conse- 
quently we  will  have  S(t)  >  0  for  te[0,T].   Hence  no  switch- 
ing occurs. 

Case  2:   S(0)  >  0  and  S(0)  <  0. 

From  equation  (12c)  ,  S  must  have  the  form 

-c1 1 
D(t)  -  -Ae  i      +  B  (15) 
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in  which  A  and  B  are  positive,  so  long  as  u  =1.   But  if  S 
becomes  negative  u  must  become  0  and  S  will  have  a  different 
form. 

Let  x  be  the  value  of  t  for  which  D  =  0.   Consider  also 
S:  S  may  become  zero  at  some  value  t-  <  t  or  S(t)  may  be 
greater  than  or  equal  to  zero.   There  are  three  subcases. 


It  may  be  that  S  >  0  for  all  t  <  x.   In  this  case  S  >  0 
for  all  t,  since  S  >  0  when  u  =  1.   Hence  no  switching  occurs 

Subcase  ii 

It  may  be  that  S  becomes  zero  at  some  time  t1  <  x  and 
Sft,)  <  0.   Then  there  is  a  switching  at  t  =  t-..   From  the 
facts  that  SCt^  =  (^SCt.,)  <  0  and  S  <  0  when  u  =  0  it  fol- 
lows that  t,  is  a  switching  point,  and  that  S  <  0  for  all 
t  >  t1. 

Subcase  iii 

It  may  be  that  t..  in  case  ii  coincides  with  x,  so  that 
SCt,)  =  0  and  S(t..)  -    0  for  some  t.  <  T.   Then  the  maximum 
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principle  does  not  determine  the  control  at  time  t,.   If  we 
choose  u  =  1  at  t  =  t..,  when  S(t.,)  >  0.   It  follows  that 
S(t),  S(t)  are  positive  for  t  >  t.    and  there  is  no  switching. 
If  we  choose  u  =  0  at  t  =  t..,  then  by  the  above  arguments 
S(t)  <  0  for  t  >  t1  and  there  is  one  switching. 

2.   It  may  be  that  S(0)  =  0.   If  S(0)  >  0  then  S(t)  is 
always  positive.   If  S(0)  <  0  then  S(t)  <  0  for  t  >  0  and 
u  ■=  0.   Finally  if  S(0)  =  0  then  we  may  choose  u(0)  arbi- 
trarily.  If  we  choose  u  =  1  initially  then  S  >  0  for  t  >  0 
by  argument  given  in  case  2-iii  and  there  is  no  switching. 
Similarly,  if  we  choose  u(0)  =  0,  S  remains  negative. 

This  takes  care  of  all  extremals  which  do  not  have  a 
singular  arc;  a  nonsingular  extremal  has  at  most  one  switch- 
ing point. 

COROLLARY: 

If  E  contains  a  singular  subarc,  then  there  are  at  most 
two  switchings. 

Proof: 

Suppose  S(0)  >  0  and  there  is  t,  >  0  such  that  S(t.)  = 
S(t..)  =  0.   We  will  show  that  there  are  at  most  two  switch- 
ings for  this  case.   The  case  for  S(0)  <  0  will  be  symmetrical 
to  the  case  of  S(0)  >  0.   Since  we  will  choose  the  singular 
control  at  t1 ,  we  may  assume  that  the  trajectory  will  remain 

on  the  singular  subarc  until  some  time  t2-   If  t2  >  T,  we 

C2 
will  have  only  one  switching  at  t-.  from  u  =  1  to  u  =  — - — . 

cl  c2 

Suppose  t7    <   T.   Assume  that  at  some  time  t-  we  choose  u  =  1 
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when  we  are  on  the  singular  subarc.   Then,  by  the  argument 
given  in  2-iii,  S  will  become  positive  and  remain  positive 
for  all  t  >  t2.   Similarly  if  we  choose  u  =  0,  S  will  be- 
come negative  and  remain  negative.   Thus  we  will  have  at 
most  two  switchings,  one  at  t,  ,  say,  when  the  extremal  enters 
the  singular  arc  and  one  at  t~  when  it  leaves  it. 

THEOREM  2 

Consider  a  path  E1f,  defined  by  the  constant  control 
u=l,  0  <  t  <  t1 ,  and  u=0,  t,  <  t  <  T,  so  that  E10  has 
one  switching  and  no  singular  subarc.   Let  Jlf)  be  the  as- 
sociated criterion  function.   If  X'S  >  a-./c,  and  X-  >  a~T, 
then  there  is  no  maximizing  extremal  of  the  form  E10#   By 
symmetry,  there  cannot  be  one  of  the  form  Eni  in  which 
x£  >  a~/c~  and  X?  >  a..T.   That  is,  a  properly  chosen  constant 
control  is  always  better  than  a  mixed  policy  of  this  type. 

Proof: 

Suppose  we  choose  the  extremal  control  u  =  1  for  te[0,t-.] 
and  u  =  0  for  te[t1,T].   Then  from  (12a)  and  (13a),  we  have 

3.       C  "fc     3. 

x^Ct^  =  max  {(xj  -  ^  )e  X   +  ^  ,  0} 
x-Ct-.)  =  max  {  x£  -  a7ti ) >  0}. 
For  te[t1,T], 

x±(t)    =   max  (xj(t)  -  a1(t-t1),0} 

=  max  {x?  -  —  )e  X  l   +   —  -  aJt-tJ.O} 
1    c1  J  c1     lv    1' ' 
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x2(t)  =  max{(x5(t1)  -  -±   )e  L  i  +  _i  ,  0 

a-    c-Ct-t.)   a? 
=  max{(x°  -  a2t1  -  ^  )  e        +  _£  ,  0}, 

Suppose  x.,(t),  x2(t)  >  0  for  all  te[0,T];  then  the  payoff 
will  be 


J1Q  =  kx[xj   -±)e1   1   +  J^  -    8l(T  -  x^]  +     (16) 


al  ^aCltl  ,  a] 

—  je     +  — 

1  cl 

a9  c,(T-t)    a. 


k2itx2  -  a2ci -4)e  2     +4] 


a,    c,t,    a.,  c2(T-t1) 

j  -  c?R?e 
1  "1 


JIo  "  clkltXi  "  cT  )e  '  '  +  cT  ]  •  C2k2e  2  "   1  Cx5-a2tl) 


a.,    c1t1        cCT-t-,)  a0 

J10  -  clklK  -  4  )e      +  C2k26  *       !X2"22tl)  *  4  ' 


al         X2 
Since  x?  >  —  and  T  <  —  ,  we  always  have  J^n   >   0.   Therefore, 
X~C-|        ~a^  -Lu 

we  will  have  a  maximum  payoff  at  the  boundary  value  of  t, , 
i.e. ,  t-  ■  0  or  t-  -  T. 
If  t1  =  0, 

max  J1Q  =  JQ  =  k^xj-a^)  +  k£  [x»  -  ^  )  e°2   +  *|  ] 
If  tj  =  T, 

max  J1Q  =  J2  =  kl[xJ  -  i  )eClT  +  !l  ]  *  k2 (x°  -  a.T) . 

In  other  words,  we  cannot  improve  the  payoff  by  switching  the 
control.   We  will  have  the  same  result  for  Jn1 ,  for  which  we 
switch  the  control  from  u  =  0  to  u  -  1.   We  may  summarize 
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this  in  the  following  corollary: 

COROLLARY: 

All  nonsingular  controls  which  maximize  the  criterion 
function  for  this  problem  have  the  form  u  =  1  or  else  u  =  0 
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V.   DETERMINATION  OF  THE  OPTIMAL  CONTROL 

A.   SUFFICIENT  CONDITIONS  FOR  OPTIMALITY 

As  is  well  known  the  maximum  principle  only  provides 
necessary  conditions  of  optimality.   Second  order  sufficient 
conditions  (see  [1])  are  impractical  to  apply  to  the  problem 
at  hand.   Mangasarian's  sufficient  conditions  (see  [10]), 
unfortunately  do  not  apply  to  the  problem  at  hand. 

The  following  is  an  adoptation  of  Mangasarian's  Theorem 
to  a  form  more  closely  related  to  the  specific  problem  under 
study.   Consider  the  following  optimal  control  problem. 


Maximize 
U 


F(t,x,u)dt  +  G(xo,tf) 


subject  to  X  -  f(t,x,u), 

g(t,x,u)  >  0, 

the  initial  conditions  C~  (x  )  >  0,  • 

1   o   -   ' 

the  terminal  conditions  C7(xf)  >  0, 

where  x   =  x(0),  xf  =  x(T)  for  specified  T. 

THEOREM: 

Assume  that  F(t,x,u)  and  each  component  of  f(t,x,u)  and 

g(t,x,u)  are  dif ferentiable  and  concave  in  the  variable  (x,u) 

for  te[Q,T],  and  also  each  component  of  C, (x  )  and  C7(xf)  is 

dif  ferentiable  and  concave  in  x   and  x,-  respectively.   Then 

o      f    r       J 

a  control  u(t)  yields  a  maximum  for  problem  (17)  if  there 
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exist  u(t),  x(t),  p(t),  q(t),  r(t) ,  s(t)  such  that; 

T  T 

H(t,x,u,p,q,r,s)  =  F(t,x,u)  +  p  f(t,x,u)  +  q  g(t,x,u) 

+  rC1(xQ)  +  sC2(xf) 

Hx  =  FxCt,x,u)  +  pTfx(t,x,u)  +  qTgx(t,x,u)  +  p(t)  =  0 
Hu  =  Fu^*^'^  +  pTfu(t,x,u)  +  qTgu(t,x,u)  =  0 

GX  ^o5*^  +  7C1^  +  P(0:)  =  ° 
o 

Gx  (*o'*f)  +  sC2Cx£)  -  p(T)  =0 
r  >  0, 
7c1(xo)  =0 
F  >  o 

sC2(xf)  =  0 
q  >  0 

qg(t,x,u)  =  0 
p  >  0. 

However,  for  the  problem  at  hand  (lb)  it  is  readily  seen 
that  the  right-hand  sides  of  the  differential  equation  are 
not  concave  functions  of  the  state  and  control  variables  so 
that  the  above  theorem  cannot  be  applied.   We  will  return  to 
discuss  our  problem. 
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B.   DETERMINATION  OF  OPTIMAL  CONTROL  IN  TERMS  OF  INITIAL 
CONDITIONS  AND  TERMINAL  CONDITIONS 

In  general  we  cannot  determine  the  optimal  control  with- 
out some  computation  and  comparison.   However,  in  some  cases 
we  can  establish  its  form  easily.   In  this  section  two  theo- 
rems are  proved.   The  first  one  establishes  a  sufficient 
condition  for  some  nonsingular  arcs  to  be  optimal.   The 
second  theorem  establishes  a  necessary  condition:   An  opti- 
mizing arc  never  has  a  terminal  singular  subarc. 

THEOREM  5 

al 
If  E  does  not  contain  a  singular  subarc,  and  if  x1  >  — - 

x°        c  k  x°  1 
and  0  <  T  <  min{  — ,  —  In   -K-  1l    },  then  constant  control 

a2   Cl    C2K2X2  a2 

u  =  1  is  the  optimal  control.   By  symmetry,  if  x2  >  —  and 

x°   1     c9k,,x°                   w  2 

0  <  T  <  min{—  —  In      =i  -  „  >>  then  the  constant  control 
^         a.  j  c-.           ciki  l 

u  =  0  is  the  optimal  control. 

Proof: 

al 

Let  E  be  a  nonsingular  extremal  for  which  x1  >  — -  and 

x°  "  1 

—  >  T.   Then,  by  theorem  2,  we  know  that  an  optimal  control 

a2 

is  constant;  either  u  =  0  or  u  =  1.   Now  assume  S(t)  <  0 

for  all  te[0,T]  . 

c?T 
From  (13c),  S(0)  =  -c2k°e    x°  +  c1k1xj  <  0.   Hence 

T  >  _L  in      1, 1  ]    .   Therefore,  by  contrapositive  argument, 

if  T   <    (l/c2)an(c1k1xj/c2k2xp,    then   S(0)    >    0.      Thus    u   =    0 

is  not  an  optimal  control,  consequently  the  optimal  control 

is  u  =  1.   A  similar  argument  establishes  the  symmetrical 

result. 
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THEOREM  4 

A  maximizing  control  does  not  have  a  singular  subarc 
ending  at  t  =  T. 

Proof: 

Assume  that  we  have  a  path  which  has  singular  control  at 
the  end,  say,  at  least  in  an  interval  t,  <  t  <  T.   Let  us 
consider  At  =  T  -  t~    to  be  small.   We  want  to  compare  this 
path  with  paths  which  have  other  controls  in  the  interval 
(t.,T) .   In  particular  we  want  to  compare  the  three  trajectories 
which  have  the  controls  u=  s,  u=  1,  oru=  0,  in  the  final 
interval  (t  T)-. 

Consider  the  Taylor  expansions  of  J  at  t  =  t, . 

dJ  1    d2J 

-at  )tl"At   +  2   C  3t^  ^ 


JS  ■  Vsh,  +  c  "at  >t/At  +  I  c  ar^  )t.*At2  +  °(At3) 


dJ1  -.    d2J., 

Jl  "  V,h.    +  C  -&   )t.-4t  ♦  f  C  ^   )t  -»t«  ♦  O(At') 

1  1  1 

dJ  -    '   d2J 

Jo  =  cjs\  +  c  tf  )tl"At  +  I  c  ar^  \*At2  +  0^At3^  <18> 

Since  we  assume  that  the  trajectory  lies  on  the  singular  sur- 
face at  the  end  of  the  battle,  to  have  singular  control  on  an 
admissible  extremal  at  time  T,  it  is  necessary  that 

S(T)  =  S(T)  =  0. 

From  (6)  and  (7) ,  we  have 

alCl 
alClkl   =    a2C2k2    '    1-e''    k2    ~   a^c~Jkr 
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Also 

al  "  a2 

in  the  interval  of  singular  control.   Then  at  time  t1 ,  this 
condition  must  be  satisfied. 

From  (10) ,  (12a) ,  (13a)  and  (14a) ,  we  may  write  the 
payoff 

a-  +  c9     a1         a1    a(T-t1) 

Js  -  ki  c  -h^1 )  i  4 +  <xi  -  4  ^         i 

where  x*  =  x-ift-,)  >  0 ,  x*  =  x2(t1)  >  0. 

Similarly, 

a-.        c-  axc  .        a-.        cAI-t] 

1  1    L    c1        c-      1  c2  cl 


a1c1  c1  alc1        c2(T-t1) 

0  1    L      c|  1  lv  l-7         *■   c2      1  c|      •* 

Their  derivatives  with  respect   to   t,    are; 

a..         a(T-t1) 

Js  -  -kici  &{  "  ET  )e 

a1         a(T-t.) 

J"  =  k-c-oGtl    -     —  )e  L 

s  11^1  a      J 


a-.c.  a1        c7(T-t..) 

J»    =    -k-    [    -   -A-i  +   Cl(x*    -   —  )e   ^  i    ] 

1  1    L  c?  1K    1        c.    J  J 

a-        cJT-tJ 

a   c  c    (T-t    ) 

J6  ■  -ki  i  -i  +  cci*i  -  4^  ^  i 
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a-        c7(T-t.,) 
Hence, 

\%  Js =  kicic  ^§7xi  '  ai) 


lim  J"  =  kc?(x{  -  i  ) 
t  T   x    x   i      i   c1 

lim  J"  =  k?c1c7(x|  -  — -  ) 
tj+T   u    L    L   L      1   c2 

Since  (J')T  =  (Ji)T  =  (J(pT»  from  (18)  we  have 

Jl  "  Js  =  ¥J1    -   Js^T  "  V2  +  °<T  "  ^3  "  c^Tc^  >   ° 

J0  -  Js  "  hJ0    "  Js^T  "  *1>  +  °^T  "  V  "  cf^c'1  >  °  ' 

Therefore,  J.,  >  J  ,  and  J  >  J  ,  at  least  if  T  -  t-,  is  suf- 
'   1    s '      os'  1 

ficiently  small. 

Thus  we  see  that  if  a  trajectory  ends  with  a  singular 
subarc  at  the  end  of  the  battle,  it  cannot  be  an  optimal 
trajectory. 

From  studying  this  problem  we  feel  that  following  result 
is  true.  .  If  so  it  would  reduce  this  particular  problem  to 
the  two  cases  for  the  optimum,  i.e.,  u  =  0  or  u  =  1.   We 
were  unable  to  prove  this  result  or  to  find  a  counter  example, 
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THEOREM  5 

A  maximizing  arc  never  has  a  singular  subarc  nor  a  corner; 
that  is  the  optimal  control  is  always  either  u  =   1  or  u  =   0, 
depending  on  the  given  conditions. 

(Not  proved.) 
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VI.   DISCUSSION 

Considering  the  theorems  given  above,  we  see  that  if  an 
extremal  does  not  contain  a  singular  subarc,  the  optimal 
control  u*  is  (a)  constant  for  all  te(0,T)  and  is  (b)  either 
0  or  1.   Which  of  these  policies  is  better  may  be  determined 
by  direct  computation  and  comparison  of  values  of  the  cri- 
terion functional  using  the  attrition-rate  coefficients  and 
the  initial  conditions.   Thus,  we  see  that  in  this  case  the 
optimal  time-sequential  fire-support  policy  for  Blue  is  to 
concentrate  all  his  artillery  resources  on  one  of  the  Red 
units  for  the  entire  duration  of  battle.   It  should  be 
pointed  out  that  even  if  an  extremal  contains  a  singular 
subarc,  Blue's  artillery  fire  will  (optimally)  still  be 
concentrated  on  one  of  the  Red  units  at  the  end  of  battle. 
This  is  because  an  optimal  fire-support  policy  has  been 
characterized  as  not  terminating  with  a  singular  subarc. 

We  have  not  been  able  to  show  which  policy  is  optimal 
when  the  trajectory  contains  a  singular  subarc  for  some 
period  of  time  before  the  end  of  the  problem.   Thus,  the 
complete  explicit  determination  of  the  optimal  fire  support 
policy  was  not  accomplished.   However,  the  basis  has  been 
given  for  doing  this  in  the  future. 

It  should  be  pointed  out  that  the  insights  gained  into 
optimal  fire-support  policies  obtained  from  the  model  (1) 
are  no  more  valid  than  the  model  itself.   Thus  the  reader 
should  be  cautioned  that  the  optimal  fire-support  policy 
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determined  here  may  only  apply  to  the  problem  (1).    As 
discussed  by  Taylor  (see  [14]),  a  combat  optimization  prob- 
lem consists  of  the  following  three  parts:   (1)  decision 
criteria,  (2)  model  of  the  planning  horizon  (e.g.  conflict 
termination  condition),  and  (3)  model  of  combat  dynamics. 
For  different  objectives  or  different  combat  dynamics,  the 
optimal  fire-support  policy  may  well  be  different  from  that 
given  here. 
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VII.   CONCLUSIONS 

For  the  problem  considered  in  this  thesis,  the  optimal 
fire-support  policy  is  piecewise  constant  with  at  most  two 
switches.   If  one  never  divides  fire  between  the  two  enemy 
units  (singular  control) ,  then  the  optimal  policy  is  con- 
stant and  either  0  or  1  (i.e.,  concentrate  all  fire  on  one 
of  the  enemy  units) .   Although  complete  details  could  not 
be  worked  out  within  the  time  constraints  for  this  thesis, 
it  is  conjectured  that  this  is  the  optimal  policy  (i.e., 
always  concentrate  all  supporting  fire  on  one  of  the  enemy 
units).   The  reader  should  be  cautioned  that  these  conclu- 
sions may  only  apply  to  the  specific  fire  support  problem 
studied  here. 
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